PROTEIN SECRETION 
Background of the Invention 

Field of the Invention 

The present invention relates generally to the field of protein production by 
5 cell culture. In particular aspects, the invention provides methods and compositions 
for the production of polypeptides, especially by eukaryotic cell culture, which 
enhance secretion and facilitate recovery of the polypeptides. 
Description of Related Disclosures 

Many eukaryotic cell surface- and secreted proteins are post-translationally 

10 processed to incorporate N-linked and O-linked carbohydrate (Kornfeld and 

Kornfeld (1985) Annu. Rev. Biochem. 54:631-64; Rademacher et aL, (1988) Annu. 
Rev. Biochem. 57:785-838). Protein glycosylation is thought to subserve a variety 
of functions including: augmentation of protein folding, inhibition of protein 
aggregation, regulation of intracellular trafficking to organelles, increasing 

1 5 resistance to proteolysis, modulation of protein antigenicity, and mediation of 

intercellular adhesion (Fieldler and Simons (1995) Cell 81:309-312; Helenius (1994) 
Mol. Biol, of the Cell 5:253-265; Olden et aL, (1978) Cell, 13:461-473; Caton et 
aL 9 (1982) Cell 37:417-427; Alexander and Elder (1984) Science 226:1328-1330; 
Flack et aL. (1994 J. Biol. Chem. 269:14015-14020). In higher organisms, the 

20 nature and extent of glycosylation can markedly affect the circulating half-life and 
bio-availability of secreted proteins by mechanisms involving receptor mediated 
uptake and clearance (Ashwell and Morrell (1974) Adv. Enzymol. 41 :99-128; 
Ashwell and Harford (1982) Ann. Rev. Biochem. 51:531-54). Receptor systems 
have been identified that are thought to play a major role in the clearance of serum 

25 proteins through recognition of various carbohydrate structures on the glycoproteins 
(Stockert (1995) Physiol. Rev. 75:591-609; Kery et aL., (1992) Arch. Biochem. 
Biophys. 298:49-55). Thus, production strategies resulting in incomplete attachment 
of terminal sialic acid residues might provide a means of shortening the 
bioavailability and half-life of secreted glycoproteins by promoting clearance by the 

30 hepatic asialoglycoprotein receptor. Conversely, expression strategies resulting in 
saturation of terminal sialic acid attachment sites might lengthen protein 
bioavailability and half-life by preventing uptake by hepatic receptors. In the 
development of recombinant glycoproteins for use as pharmaceutical products, it has 
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been speculated that the pharmacodynamics of recombinant proteins can be 
modulated by the addition or deletion of glycosylation sites from a glycoprotein's 
primary structure (Berman and Lasky (1985a) Trends in Biotechnol. 3:51-53). 
However, studies have reported that the deletion of N-linked glycosylation sites 
5 often impairs intracellular transport and results in the intracellular accumulation of 
glycosylation site variants (Machamer and Rose (1988) J. Biol. Chem. 263:5955- 
5960; Gallagher et al, (1992) J. Virology 66:7136-7145; Collier et a/.., (1993) 
Biochemistry 32: 7818-7823; Claffey et al, (1995) Biochemica et Biophysica Acta 
1246 :l-9; Dube et aL, (1988) J. Biol. Chem. 263:17516-17521). While 
10 glycosylation site variants of secreted proteins can be expressed intracellular^, it 
has proved difficult to recover useful quantities from growth conditioned cell culture 
medium. 

It has been shown that impaired secretion of the HIV-1 envelope 
glycoprotein containing N-linked carbohydrate structures can be partially overcome 

15 by replacing the native signal sequence of the glycoprotein with the signal sequence 
and 5' untranslated region of the herpes simplex virus type 1 glycoprotein D (HSV 
gD-1) (Lasky et aL, (1986) Science 233:209-212; Berman et aL, (1989) J. Virol. 
63:3489-3498). Studies have reported the use of the native secretory peptide of 
human tissue type plasminogen activator in cell culture production of non-t-PA 

20 glycoproteins. (International Publication No. WO 96/1 7067; Krasney and Young 

(1992) Cytokine 4:134-143; Rhodes et aL, (1994) J. Gen. Virol. 75:207-213). None 
have reported a strategy for the secretion and recovery of proteins wherein one or 
more native glycosylation sites have been added to or deleted, from the protein's 
primary structure. 

25 Summary of the Invention 

The present invention provides methods and compositions useful in the 
production of proteins by recombinant cell culture. The methods and compositions 
overcome intracellular retention of proteins and facilitate extracellular recovery of 
the produced proteins. In particular embodiments, the invention provides methods 

30 and compositions for the production of polypeptides other than tissue-type 

plasminogen activators (t-PA's) utilizing a precursor peptide corresponding to the 
amino terminal signal and/or pro peptides naturally associated with a mammalian t- 
PA which act to direct the secretion of the mature t-PA. In particular embodiments, 
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the invention provides for the export and secretion of polypeptides other than t-PA's. 
For example, the invention provides methods and compositions useful in the 
recombinant production and extracellular recovery of novel chimeric or fusion 
proteins such as immunoadhesins and especially those with poor secretion kinetics. 
5 In particular embodiments the invention provides for the production and recovery of 
glycoproteins that have been altered from their native sequence to add or remove 
one or more glycosylation sites. Advantageously, the invention improves and in 
preferred embodiments restores the export and secretion of such glycosylation site 
variants. It is a further advantage of the present invention that greater quantities of 

1 0 the polypeptides produced utilizing the compositions and methods of the present 
invention can be recovered from the extracelluar medium of the cultured cells from 
which they are produced. 

Accordingly, in particular embodiments, the invention provides nucleic 
acids, expression systems and host cells, as well as methods for the production of 

15 non t-PA polypeptides, such as immunoadhesins and especially glycosylation site 
variant polypeptides, in cell culture. In one embodiment of the present invention, a 
DNA construct is provided for the export and secretion of such polypeptides 
comprising a first DNA segment encoding a precursor peptide corresponding to all 
or a portion of the amino terminal signal and/or pro peptides naturally associated 

20 with a mammalian and preferably a human t-PA. The first DNA segment is 

operably linked, in frame, to a second DNA segment encoding the heterologous 
(non-t-PA) polypeptide. In particular aspects, the first DNA segment encodes a 
peptide corresponding to a mammalian and preferably a human t-PA pro-sequence 
as defined herein. According to this aspect of the present invention, the first DNA 

25 segment is operably linked to a second DNA segment encoding a heterologous 
polypeptide. 

Certain embodiments additionally comprise a DNA segment encoding a 
peptide corresponding to a mammalian signal and or pro-peptide upstream of and 
operably linked to the first DNA segment. For example, when the heterologous 
30 polypeptide is an immunoadhesin, such as a TNFR-IgG chimera, the first DNA 
segment may be preceded by a DNA segment encoding a mammalian t-PA or, 
alternatively, a heterologous signal and or pro-sequence. According to this aspect of 
the invention the mammalian t-PA signal sequence or other heterologous signal and 
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or pro-sequence and the pro-peptide amino acid sequence comprise the precursor 
peptide of the invention. 

In preferred embodiments, the first DNA segment encodes a native or 
naturally occurring signal and pro peptide of a mammalian t-PA and especially a 
5 human t-PA. The second DNA sequence preferably encodes a polypeptide other 
than a mammalian t-PA such as, for example, an immunoadhesin. According to 
further aspects of the invention, the second DNA sequence encodes a naturally 
occurring or chimeric polypeptide other than a mammalian t-PA wherein one or 
more glycosylation sites have been added to or deleted from the polypeptide's native 

10 sequence. The invention further provides a DNA construct comprising one or more 
additional DNA segments operably linked to the first and second DNA segments. 

The invention further provides a cultured eukaryotic host cell comprising a 
DNA construct having a first DNA segment encoding a precursor peptide the 
precursor peptide corresponding to all or a portion of a mammalian t-PA signal and 

1 5 or pro peptide and a second DNA segment operably linked to the first DNA 
sequence, the second DNA sequence encoding a polypeptide other than a 
mammalian t-PA and preferably a polypeptide other than a mammalian t-PA 
wherein one or more glycosylation sites have been added to or deleted from the 
polypeptide. 

20 The invention further provides methods of producing a heterologous 

polypeptide especially a polypeptide which has been altered to delete one or more 
native N-linked glycosylation sites comprising the steps of 

(a) culturing a eukaryotic host cell comprising a DNA construct, the DNA 
construct comprising: 

25 a first DNA segment encoding a precursor peptide corresponding to all or a 

portion of a mammalian tissue plasminogen activator signal and or pro peptide; and 

a second DNA segment operably linked to the first DNA sequence, the 
second DNA sequence encoding a polypeptide other than t-PA wherein one or more 
native N-linked glycosylation sites of the polypeptide have been deleted from the 

30 polypeptide wherein the eukaryotic host cell expresses the first and second DNA 
segments and the polypeptide is secreted from the cell; and 

(b) recovering the polypeptide so produced. 
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Brief Description of the Drawings 

Figure 1 . Diagram of a tumor necrosis factor immunoglobulin chimeric 
molecule (TNFR-IgGl) and signal sequences. TNFR-IgGl is a chimeric protein 
consisting of the extracellular domain of the p55 TNF receptor fused to the hinge 
5 and Fc domain of an immunoglobulin heavy chain. TNFR-IgGl is secreted as a 
homodimer with four N-linked glycosylation sites (squares) per monomer. The 
proteins expressed in this study were synthesized using the wild type TNFR signal 
sequence containing 29 amino acids (SEQ ID NO: 2), or a combination of the TNFR 
signal sequence and the signal and/or pro-sequence of human tissue plasminogen 

1 0 activator (tPA) (SEQ ID NO: 1 ). 

Figures 2A-2D. Effect of the signal sequence on the secretion kinetics of 
TNFR-IgGl and tPA.TNFR-IgGl. Parallel cultures of human embryonic kidney 
cells were transfected with plasmids encoding either TNFR-IgGl utilizing the native 
signal sequence or the signal and pro-sequence of t-PA (tPA.TNFR-IgGl) for pulse 

1 5 labeling experiments. Two days post-transfection the culture, medium was removed 
and replaced (after washing two times) with methionine- and cysteine-free DMEM 
supplemented with ( 35 S)-labeled methionine and cysteine. Cells were labeled at 
37°C (5% CO2) for the times indicated. The labeling reaction was terminated by 
washing the cells immediately in chilled (4°C) PBS followed by detergent 

20 extraction. Cell lysates and cell culture supernatants were precipitated by the 

addition of Protein A SEPHAROSE™. The Protein A:TNFR-IgGl complexes were 
pelleted by centrifugation, washed repeatedly and eluted in SDS-PAGE sample 
buffer containing mercaptoethanol. The eluted protein was resolved on 10% SDS- 
PAGE gels and visualized by autoradiography. Figure 2A, intracellular (i.e. cell 

25 associated) TNFR-IgGl; Figure 2B, Secreted TNFR-IgGl; Figure 2C> intracellular 
tPA.TNFR.IgG; Figure 2D, secreted tPA.TNFR.IgG. 

Figures 3A-3F. Secretion kinetics of TNFR-IgGl signal sequence and 
glycosylation mutants. Plasmids encoding fully glycosylated (Figure 3A) or 
mutated forms of TNFR-IgGl (Figures 3B-3F) were transfected into 293 cells and 

30 analyzed by pulse-chase analysis as described in Figure 2. Autoradiographs were 
analyzed with a scanning densitometer and the optical density values for 
supernatants (o) and cell lysates (■) were normalized and plotted as a function of 
time (0-24 hr). The glycosylation site mutants are identified using the single letter 



amino acid code with reference to the amino acid residue present at each of the four 
N-linked glycosylation sites (amino acid positions 14, 105, 111, and 248). Thus, the 
designation NNNN represents the wild type TNFR-IgGl, whereas NQNN indicates 
TNFR-IgGl where glutamine replaced asparagine at the second glycosylation site 
5 (position 105). Figure 3 A, TNFR-IgGl; Figure 3B, tPA.TNFR-IgGl; Figure 3C, 
TNFR-IgGl mutant lacking 2 native glycosylation sites (NNQQ); Figure 3D, 
tPA.NNQQ; Figure 3E, QSNQ; Figure 3F, tPA.QSNQ. 

Figures 4A-4C. Immunoprecipitation of TNFR-IgGl glycosylation mutants. 
Plasmids encoding either TNFR-IgGl or TNFR-IgG glycosylation site mutants were 

10 transfected into 293 cells using calcium phosphate precipitated DNA. Two days 
post-transfection the culture, medium was removed and replaced (after washing two 
times) with methionine- and cysteine-free DMEM supplemented with ( 35 S)-labeled 
methionine and cysteine. Cells were labeled overnight at 37°C (5% CO2). Cell 
lysates and cell culture supernatants were precipitated by the addition of Protein A 

1 5 SEPHAROSE™. The Protein A: TNFR-IgGl complexes were pelleted by 
centrifiigation, washed repeatedly and eluted in SDS-PAGE sample buffer 
containing mercaptoethanol. The eluted protein was resolved on 10% SDS-PAGE 
gels and visualized by autoradiography. The glycosylation site mutants were 
examined in four separate experiments (Figures 4A-4D) where transiently expressed 

20 TNFR-IgGl (TNFR-IgGl) expressed in 293 cells or a stable CHO cell line 

expressing TNFR-IgG (TRY+) were used as positive controls. Background binding 
was determined in experiments with cells transfected with thrombopoeitin (TPO-). 
The mobilities of molecular weight markers are indicated at the left margins. • 

Figures 5A and 5B. Effect of signal sequence replacement on the secretion 

25 of TNFR-IgGl glycosylation site mutants. Plasmids encoding TNFR-IgGl 

glycosylation site mutants containing the tPA signal/pro sequence (tPA) (SEQ ID 
NO: 1) or the TNFR signal sequence (TNFR) (SEQ ID NO: 2) were transfected into 
293 cells, metabolically labeled with ( 35 S) methionine and cysteine, and 
immunoprecipitated from cell culture supernatants (S) or cell lysates (L) by the 

30 addition of Protein A SEPHAROSE™ as described in Figure 4. The Protein 

A:TNFR-IgGl complexes were pelleted by centrifiigation, resolved by SDS-PAGE, 
and visualized by autoradiography. The mobilities of molecular weight markers are 
indicated at the left margins. 
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Figure 6. Characterization of TNFR-IgGl glycosylation mutants purified for 
receptor binding studies. TNFR-IgGl glycosylation site mutants were purified by 
protein A affinity chromatography from growth conditioned cell culture supernatants 
of stabile transfected CHO cells. The purified proteins were treated with SDS- 
5 PAGE sample buffer with- (lanes 8-12) or without (lanes 3-6) added 2- 

mercaptoethanol and resolved by SDS-PAGE. The resulting gel was stained with 
Coomassi blue. Each lane contained 3 ug of purified protein. Lanes 3 and 8 
contained fully glycosylated TNFR-IgGl; lanes 4 and 9 contained the NNQQ 
glycosylation site mutant; lanes 5 and 10 contained the NSNQ glycosylation site 

10 mutant; lanes 6 and 1 1 contained the QSNQ glycosylation site mutant. Lane 1 
contained molecular weight standards (indicated at left margin), lanes 2,7, and 12 
contained sample buffer alone. 

Figures 7A-7D. Competitive binding of ( 125 I) : labeled TNF to TNFR-IgGl 
glycosylation mutants. Purified TNFR-IgGl and TNFR-IgGl glycosylation site 

1 5 mutants were captured onto micotiter plates coated with affinity purified goat 

antibodies to human IgG Fc domain. The captured receptor chimeras were reacted 
with ( 125 I)-labeled TNF along with varying concentrations of unlabeled TNF. The 
binding of TNF to fully glycosylated TNFR-IgGl is shown in Figure 7 A; binding to 
the NNQQ glycosylation mutant is shown in Figure 7B; binding to the NSNQ 

20 glycosylation mutant is shown in Figure 7C; and binding to the QSNQ glycosylation 
mutant is shown in Figure 7D. 

Figure 8A-C. Secretion kinetics of TNFR-IgGl signal sequence variants. 
Plasmids encoding TNFRl-IgGl utilizing the signal sequence of TNFR1 (SEQ ID 
NO: 2)(Figure 8 A), the signal-pro sequence of tPA (SEQ ID NO: l)(Figure 8B) or 

25 the signal sequence of TNFR1 (SEQ ID NO: 2) and a pro-sequence (SEQ ID NO: 7) 
of human t-PA (SEQ ID NO: 8) (Figure 8C) were transfected into CHO cells and 
analyzed by pulse-chase analysis as described in Figure 2. Autoradiographs were 
analyzed with a scanning densitometer and the optical density values for 
supernatants (□) and cell lysates (O) were normalized and plotted as a function of 

30 time (0-24 hr). 
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Detailed Description of the Preferred Embodiments 

Definitions 

The terms "DNA sequence encoding", "DNA encoding" and "nucleic acid 
encoding" refer to the order or sequence of deoxyribonucleotides along a strand of 
5 deoxyribonucleic acid. The order of these deoxyribonucleotides determines the 
order of amino acids along the polypeptide chain. The DNA sequence thus codes 
for a particular amino acid sequence of a polypeptide. 

The terms "t-PA" and "tissue plasminogen activator" refer to a naturally 
occurring extrinsic (tissue-type) plasminogen activator having fibrinolytic activity 
10 and typically having a structure with five domains (finger, growth factor, kringle-1, 
kringle-2, and protease domains). Naturally occurring mammalian t-PA includes 
human species as well as other animal species such as rabbit, rat, porcine, non 
human primate, equine, murine, and ovine t-PA. Nucleic acids encoding t-PA from 
human and non-human species is known in the art. For example, human t-PA is 
15 encoded by the cDNA sequence reported in U.S. Patent Number 4,766,075, issued 
23 August 1988. 

A "precursor peptide" as used in the context of the present invention and as 
more fully described herein below, is used to refer to a polypeptide having an amino 
acid sequence corresponding to all or a portion of a naturally occurring mammalian 

20 t-PA signal and/or pro peptide which participates in the secretion of t-PA under 
native conditions. 

"Pre-pro" or "signal-pro" peptide as used in the context of the present 
. invention is meant to refer to an amino acid sequence such as that naturally 
associated with a mammalian t-PA which acts to direct the secretion of a mature 

25 polypeptide, for example, a mammalian t-PA, from a cell. As used herein, the term 
"signal-pro peptide" includes the "pre-" or "signal" sequence such as that naturally 
associated with a mammalian t-PA which functions to bind to the signal-recognition 
particle and direct the protein to the lumen of the endoplasmic reticulum (ER). A 
"signal" sequence is an amino acid sequence, characteristically hydrophobic in 

30 nature, cleaved by signal peptidases in the ER. For example, the signal sequence of 
t-PA is generally removed from the nascent t-PA co-translationally. In addition to a 
signal sequence, some mammalian proteins are associated post-translationally with a 
"pro" sequence such as the signal-pro sequence of a mammalian t-PA. The "pro-" 
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sequence serves to target the nacsent polypeptide, for example a mature t-PA, to the 
Golgi apparatus (GA) and is cleaved post-translationally. The pro-sequence is 
characteristically positively charged and commonly contains golgi peptidase 
cleavage sequences. 

5 The amino acid sequences of the mammalian t-PA signal-pro peptides are 

generally known or obtainable through conventional techniques. Signal-pro 
peptides from human and non-human t-PAs are known in the art and have been 
disclosed in, for example U.S. Patent 4,766,075; Rickles et ai, (1988) J. Biol. 
Chem. 263:1563-1560 and Feng et a.., (1990) J. Biol. Chem. 265:2022-2027. The 

10 term native signal-pro peptide specifically encompasses naturally-occurring pre- and 
prepro- sequences as defined and includes naturally-occurring variant forms {e.g., 
alternatively spliced forms) and naturally-occurring allelic variants of the t-PA 
signal-pro peptide. DNA sequences encoding mammalian t-PA amino terminal 
signal-pro peptides can be cloned as cDNA or genomic molecules according to 

1 5 techniques that are standard in the art or can be synthesized, preferably using 
automated equipment and the application of conventional synthetic protocols. 

As used herein, "heterologous polypeptide" refers generally to polypeptides 
and proteins regardless of their origin other than a t-PA or a variant thereof and 
generally having more than about ten amino acids which may or may not have one 

20 or more native or synthetic sites for the attachment of a carbohydrate. Preferably, 
heterologous polypeptides are "heterologous glycoproteins", that is a heterologous 
polypeptide as described above having one or more sites for the attachment of a 
carbohydrate in their native sequence. Examples of heterologous glycoproteins 
include molecules such as cytokines and their receptors, as well as chimeric proteins 

25 comprising cytokines or their receptors, including, for instance tumor necrosis factor 
alpha and beta, their receptors (TNFR-1; Gray et al.. 9 (1990) Proc. Natl. Acad. Sci. 
USA 87:7380-7384; and TNFR-2; Kohno et aL, (1990) Proc. Natl. Acad. Sci. USA 
87:833 1-8335) and their derivatives; renin; a growth hormone, including human 
growth hormone, and bovine growth hormone; growth hormone releasing factor; 

30 parathyroid hormone; thyroid stimulating hormone; lipoproteins; alpha- 1- 
antitrypsin; insulin A-chain; insulin B-chain; proinsulin; follicle stimulating 
hormone; calcitonin; luteinizing hormone; glucagon; clotting factors such as factor 
VIIIC, factor IX, tissue factor, and von Willebrands factor; anti-clotting factors such 



as Protein C; atrial natriuretic factor; lung surfactant; a plasminogen activator other 
than a tissue-type plasminogen activator (t-PA), for example a urokinase; bombesin; 
thrombin; hemopoietic growth factor; enkephalinase; RANTES (regulated on 
activation normally T-cell expressed and secreted); human macrophage 
5 inflammatory protein (MIP-1 -alpha); a serum albumin such as human serum 
albumin; mullerian-inhibiting substance; relaxin A-chain; relaxin B-chain; 
prorelaxin; mouse gonadotropin-associated peptide; a microbial protein, such as 
beta-lactamase; DNase; inhibin; activin; vascular endothelial growth factor (VEGF); 
receptors for hormones or growth factors; integrin; protein A or D; rheumatoid 

10 factors; a neurotrophic factor such as bone-derived neurotrophic factor (BDNF), 
neurotrophin-3, -4, -5, or -6 (NT-3, NT-4, NT-5, or NT-6), or a nerve growth factor 
, such as NGF-/3; platelet-derived growth factor (PDGF); fibroblast growth factor 
such as aFGF and bFGF; epidermal growth factor (EGF); transforming growth 
factor (TGF) such as TGF-alpha and TGF-beta, including TGF-/J1, TGF-02, TGF- 

15 03, TGF-/34, or TGF-|35; insulin-like growth factor-I and -II (IGF-I and IGF-II); 
des(l-3)-IGF-I (brain IGF-I), insulin-like growth factor binding proteins; CD 
proteins such as CD-3, CD-4, CD-8, and CD-I 9; erythropoietin; osteoinductive 
factors; immunotoxins; a bone morphogenetic protein (BMP); an interferon such as 
interferon-alpha, -beta, and -gamma; colony stimulating factors (CSFs), e.g., M- 

20 CSF, GM-CSF, and G-CSF; interleukins (ILs), e.g., IL-1 to IL-10; superoxide 

dismutase; T-cell receptors; surface membrane proteins; decay accelerating factor; 
viral antigen such as, for example, a portion of the HIV-1 envelope glycoprotein, 
gpl20, gpl60 or fragments thereof; transport proteins; homing receptors; addressins; 
regulatory proteins; antibodies and chimeric proteins, such as immunoadhesins. 

25 Preferably the heterologous glycoproteins are "glycosylation site variants" of 

any of the heterologous glycoproteins described above. More particularly, the 
glycosylation site variants according to the present invention are naturally occurring 
heterologous glycoproteins or variations thereof such as the chimeric proteins 
described above, having at least one site for glycosylation, preferably an N-linked 

30 glycosylation site, that has been added to or deleted from their native sequence. 
Such glycosylation site variants include both "glycosylation site addition variants" 
as well as "glycosylation site deletion variants" so named herein to delineate 
heterologous glycoproteins as defined above having one or more sites for 
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glycosylation, preferably N-linked, added to or deleted from their native sequence, 
respectively. 

The heterologous glycoproteins of the present invention have one or more 
carbohydrate structures that occur on the native protein expressed as N-linked or O- 
5 linked carbohydrates. The N-linked and O-linked carbohydrates differ primarily in 
their core structures. N-linked glycosylation refers to the attachment of the 
carbohydrate moiety via N-acetylglucosamine (GlcNAc) to an asparagine residue in 
the peptide chain. The amino acid sequence of the heterologous polypeptide will 
contain an asparagine-X-serine, asparagine-X-threonine, or asparagine-X-cysteine, 

10 wherein X is any amino acid except proline. O-linked carbohydrates, by contrast are 
characterized by a common core structure, which is the N-acetylgalactosamine 
(GalNAc) attached to the hydroxyl group of a threonine or serine in the native amino 
acid sequence of the heterologous polypeptide. 

The terms "tumor necrosis factor receptor" and "TNFR" in the context of the 

1 5 present invention refer to a polypeptide comprising the amino acid sequence of a 
native TNF binding polypeptide, or any combination, derivative or fragment thereof 
which is capable of binding to tumor necrosis factor-a and/or -j3 (TNF-a and/or -/?). 
This definition includes substantially intact cell surface or soluble type 1 and type 2 
TNFR polypeptides (TNFR1 and TNFR2) from natural sources, synthetically 

20 produced in vitro or obtained by genetic manipulation including methods of 
recombinant DNA technology, as well as various chain combinations of such 
polypeptides. Native TNFR include human and non-human animal, e.g. murine, 
bovine, equine, porcine, etc. TNFR polypeptides. The non-human mammalian 
species can, for example, be obtained by cross-species hybridization, using probes 

25 obtained from the human DNA sequence as hybridization probes to isolate TNFR 
cDNAs from the respective non-human mammalian cDNA libraries. 

The chimeric gene encoding TNFRl-IgGl is described in Ashkenazi et ai., 
(1991) Proc. Natl. Acad. Sci. 88:10535-10539. TNFRl-IgGl is a chimeric protein 
constructed by fusing the extracellular domain of the receptor type 1 for TNF alpha 

30 with sequences encoding the Fc domain and hinge region of IgGl (Ashkenazi et ai., 
(1991). The chimeric protein (Figure 1) contains four N-linked glycosylation sites: 
three located in the TNFR region (positions 14, 105, and 111) and one in the Fc 
domain (position 248). For simplicity, the glycosylation mutants described in this 
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invention are named using the single letter amino acid code with reference to the 
amino acid residue present at each of the four N-linked glycosylation sites at amino 
acid positions 14, 105, 111, and 248 (Figure 1). Thus, the designation NNNN 
represents the fully glycosylated TNFRl-IgGl, whereas NQNN indicates TNFR- 
5 IgGl where glutamine replaced asparagine at the second glycosylation site (position 
105). 

As used herein, the term "immunoadhesin" designates antibody-like 
molecules which combine the "binding domain" of a heterologous protein (an 
"adhesin"re.g. a receptor, ligand or enzyme) with the effector functions of 

10 immunoglobulin constant domains. Structurally, the immunoadhesins comprise a 
fusion of the adhesin amino acid sequence with the desired binding specificity which 
is other than the antigen recognition and binding site (antigen combining site) of an 
antibody (i.e. is "heterologous") and an immunoglobulin constant domain sequence. 
The immunoglobulin constant domain sequence in the immunoadhesin may be 

1 5 obtained from any immunoglobulin, such as IgGi, IgG2, IgG3, or IgG4 subtypes, 
IgA, IgE, IgD or IgM. Immunoadhesins are described in, for example, U.S. Patent 
Nos. 5,1 16,964, 5,714,147 and 5,336,603 the disclosures of which are hereby 
incorporated by reference. Immunoadhesins include CD4 (Capon et al, (1989) 
Nature 337:525-531; Traunecker et al, (1989) Nature 339:68-70; and Byrn et al, 

20 (1990) Nature 344:667-670); L-selectin or homing receptor (Watson et al, (1990) J. 
Cell. Biol. 1 10:2221-2229; and Watson et al, (1991) Nature 349:164-167); CD44 
(Aruffo et al, (1990) Cell 61:1303-1313; CD28 and B7 (Linsley et al, (1991) J. 
Exp. Med. 173:721-730); CTLA-4 (Lisley et al, J. Exp. Med. 174:561-569); CD22 
(Stamenkovic et al, Cell 66:1 133-1 144); TNF receptor (Ashkenazi et al, (1991) 

25 Proc. Natl. Acad. Sci. USA 88:10535-10539; Lesslauer et al, (1991) Eur. J. 

Immunol. 27:2883-2886; and Peppel etal, (1991) J. Exp. Med. 174:1483-1489; 
Mohler et al, (1993) J. Immunol. 151:1548-1561); NP receptors (Bennett s al, 

(1991) J. Biol. Chem. 266:23060-23067; interferon 7receptor (Kurschner et al, 

(1992) J. Biol. Chem. 267:9354-9360; 4-1BB (Chalupny etal., (1992) PNAS USA 
30 89:10360-10364) and IgE receptor a (Ridgway and Gorman, (1991) J. Cell. Biol. 

115, Abstract No. 1448). 

The terms "host cell" and "host cell line" refer to cells and cell lines derived 
from a prokaryotic or eukaryotic organism that are capable of growth and survival 
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when placed in either monolayer culture or in suspension culture in a medium 
containing the appropriate nutrients and growth factors. Suitable host cells for use 
within the present invention include any type of cell that can be engineered to 
express a DNA molecule, can be grown in culture, and have a secretory pathway. 
5 Although prokaryotic cells such as E. coli cells are capable of secreting protein at 
least into the periplasmic space, it is preferred within the context of the present 
invention to use cultured eukary otic cells, such as fungal cells, insect cells, yeast 
cells or in particular mammalian cells. The necessary growth factors for a particular 
cell line are readily determined empirically without undue experimentation, as 

10 described for example in Mammalian Cell Culture (Mather, LP. ed., Plenum Press, 
N.Y. [1984]), and Barnes and Sato, (1980) Cell, 22:649. Typically, the cells are 
capable of expressing and secreting large quantities of a particular glycoprotein of 
interest into the culture medium. Examples of such cells include SF9 insect cells 
(Summers and Smith (1987) Texas Agriculture Experiment Station Bulletin, 1555; 

1 5 and Insect Cell Culture Engineering, Goosen Daugulis and Faulkner Eds. Dekker, 
New York). Examples of suitable mammalian host cells within the context of the 
present invention may include Chinese hamster ovary cells/-DHFR (CHO, Urlaub 
and Chasin, Proc. Natl. Acad. Sci. USA, 77:4216 [1980]); dpl2.CHO cells (EP 
307,247 published 15 March 1989); monkey kidney CV1 line transformed by SV40 

20 (COS-7, ATCC CRL 165 1); human embryonic kidney line (293 or 293 cells 
subcloned for growth in suspension culture, Graham et aL 9 J. Gen Virol., 36:59 
[1977]); baby hamster kidney cells (BHK, ATCC CCL 10); mouse Sertoli cells 
(TM4, Mather, Biol. Reprod., 23:243-251 [1980]); monkey kidney cells (CV1 
ATCC CCL 70); African green monkey kidney cells (VERO-76, ATCC CRL- 

25 1 587); human cervical carcinoma cells (HELA, ATCC CCL 2); canine kidney cells 
(MDCK, ATCC CCL 34); buffalo rat liver cells (BRL 3A, ATCC CRL 1442); 
human lung cells (W138, ATCC CCL 75); human liver cells (Hep G2, HB 8065); 
mouse mammary tumor (MMT 060562, ATCC CCL51); TRI cells (Mather et al., 
Annals N.Y. Acad. Sci., 383:44-68 [1982]); MRC 5 cells; FS4 cells; and a human 

30 hepatoma line (Hep G2). 

Preferred host cells include Chinese hamster ovary cells/-DHFR (CHO, 
Urlaub and Chasin, Proc. Natl. Acad. Sci. USA, 77:4216 [1980]); dpl2.CHO cells 
(EP 307,247 published 15 March 1989). 
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"Expression vector" refers to a DNA construct containing a DNA sequence 
which is operably linked to a suitable control sequence capable of effecting the 
expression of the DNA in a suitable host. Such control sequences include a 
promoter to effect transcription, an optional operator sequence to control such 
5 transcription, optional RNA splice donor/acceptor sequences, a sequence encoding 
suitable mRNA ribosome binding sites, and sequences which control termination of 
transcription and translation. The vector may be a plasmid, a phage particle, a 
recombinant virus or simply a potential genomic insert. Once transformed into a 
suitable host, the vector may replicate and function independently of the host 

10 genome, or may, in some instances, integrate into the genome itself. In the present 
specification, "plasmid" and "vector" are sometimes used interchangeably as the 
plasmid is the most commonly used form of vector at present. However, the 
invention is intended to include such other forms of expression vectors which serve 
equivalent functions and which are, or become, known in the art. 

1 5 "Operably linked" when describing the relationship between two DNA 

regions simply means that they are functionally related to each other. For example, 
a signal sequence is operably linked to a peptide if it functions to ensure synthesis on 
membrane bound ribosomes and to translocate the nascent protein across the lumen 
of the endoplasmic reticulum as a signal sequence, participating in the secretion of 

20 the mature form of the protein most probably involving cleavage of the signal 
sequence. A promoter is operably linked to a coding sequence if it controls the 
transcription of the sequence; a ribosome binding site is operably linked to a coding 
sequence if it is positioned so as to permit translation. Therefore, nucleic acid is 
"operably linked" when it is placed into a functional relationship with another 

25 nucleic acid sequence. Generally, "operably linked" means that the DNA sequences 
being linked are contiguous, and, in the case of a secretory leader, contiguous and in 
reading phase. However, enhancers do not have to be contiguous. Linking is 
accomplished by ligation at convenient restriction sites. If such sites do not exist, 
the synthetic oligonucleotide adaptors or linkers are used in accordance with 

30 conventional practice. 

The terms "transfected," "transformed (host) cell", "transfectant" and 
"transformed" and the like refer to the introduction of DNA into a cell. The cell is 
termed a "host cell". The introduced DNA is usually in the form of a plasmid 
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derived vector containing an inserted piece of DNA. The introduced DNA sequence 
may be from the same species as the host cell or a different species from the host 
cell, or it may be a hybrid DNA sequence, containing some foreign and some 
homologous DNA. The words transformants and transformed (host) cells include 
5 the primary subject cell and cultures derived therefrom, without regard to the 
number of transfers. It is also understood that all progeny may not be precisely 
identical in DNA content, due to deliberate or inadvertent mutations or amplification 
of chromosomal fragments. Mutant progeny that have the same function or 
biological property as screened for in the originally transformed cell are included. 

10 The term amino acid or amino acid residue, as used herein, refers to naturally 

occurring L amino acids or residues, unless otherwise specifically indicated. The 
commonly used one- and three-letter abbreviations for amino acids are use herein 
(Lehninger, A. L., Biochemistry, 2d ed., pp. 71-92, Worth Publishers, N. Y. (1975)). 
Modes for Carrying out the Invention 

1 5 Studies have demonstrated that deletion of N-linked glycosylation sites from 

a glycoprotein's primary structure often impairs the intracellular transport and 
secretion of the polypeptide. According to the present invention, the defect in 
secretion of such polypeptides can be overcome by operably linking the nucleic acid 
sequence encoding the polypeptide to a nucleic acid sequence encoding a precursor 

20 peptide. Surprisingly, the precursor polypeptide increased the secretion efficiency 
of the parent glycoprotein as well. The precursor peptide has an amino acid 
sequence which correspond to all or a portion of a naturally occurring mammalian t- 
PA signal-pro peptide which participates in the secretion of t-PA under native 
conditions. 

25 Preferably the precursor peptide of the present invention is "capable of 

restoring the export and secretion" of a heterologous polypeptide, especially a 
heterologous glycoprotein such as an immunoadhesin and more preferably a 
heterologous glycosylation site variant glycoprotein to the extracellular space. As 
will be appreciated by the skilled artisan "capable of restoring the export and 

30 secretion" of a polypeptide to the extracellular space is a relative term. Thus the 
term when used to describe the biological activity of the precursor peptide means a 
peptide that when linked to a heterologous polypeptide produces an increase in the 
amount of heterologous polypeptide that can be recovered from the extracellular 
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space when compared to the amount recoverable from the extracellular space in the 
absence of the precursor peptide of the present invention and in the presence of the 
native signal and/or pro sequence of the heterologous polypeptide. 

To determine whether the precursor peptide improves the export and 
5 secretion of the polypeptide, the efficiency of intracellular transport can be measured 
by routine experimentation. For example, a plasmid can be constructed for transient 
transfection studies in eukaryotic host cells. Pulse-chase experiments similar to 
those described herein will show that replacement of the native signal sequence 
associated with the heterologous polypeptide with the precursor peptide of the 

10 present invention results in an improvement in secretion efficiency. Thus, after 
transient expression at, for example 24 hours, the amount of heterologous 
polypeptide secreted into the culture medium can be measured and compared to a 
control using a native or other signal sequence. For example, approximately 70- 
80% and preferably 80-100% of the pulse-labeled heterologous polypeptide can be 

1 5 secreted into the cell culture medium when a precursor peptide of the present 
invention is employed in the expression construct, whereas only 50-60% of the 
heterologous polypeptide is secreted using the wild-type or native signal sequence. 
This represents a 20 to 50% increase in secretion efficiency. 

The increased efficiency of secretion achieved according to the present 

20 invention utilizing the precursor peptide is also apparent from the kinetics of 
secretion of the heterologous polypeptide. For example, when the native signal 
sequence is employed, 40% of the pulse-labeled heterologous polypeptide is 
secreted in 2-4 hrs. after pulse labeling. However when the native signal sequence is 
replaced with the precursor peptide corresponding to a mammalian t-PA signal-pro 

25 peptide, approximately 60% of the heterologous polypeptide is secreted in this time 
frame. 

The precursor peptide of the present invention has an amino acid sequence 
which corresponds to all or a portion of the amino acid sequence of a mammalian t- 
PA signal-pro peptide including native signal-pro peptides of t-PA having the same 
30 amino acid sequence as a signal-pro peptide of a mammalian t-PA sequence derived 
from nature. Using human t-PA as but an example, the signal-pro peptide precedes 
the mature t-PA protein sequence and acts to direct export and secretion of the 
mature protein. The approximately 35 amino acid signal pro-peptide of human t-PA 
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consists of a signal sequence of about 21 (SEQ ID NO: 3) or 22 (SEQ ID NO: 12) 
amino acid residues, also called a pre-sequence, and an 1 1 (SEQ ID NO: 4) or 10 
(SEQ ID NO: 13) amino acid pro-sequence, including a 3 amino acid exopeptidase 
cleavage site (see, for example, Figure 1 infra ; Example 8, infra ; and Berg and 
5 Grinnell (1991) Biochem. Biophys. Res. Commun. 179:31289-1296). However, it is 
the ability of the precursor peptide to restore or increase export and secretion of the 
heterologous polypeptide and not the length of the precursor peptide that is 
important. Therefore, in one embodiment of the invention, the signal-pro peptide of 
a mammalian t-PA is a native human sequence comprising amino acids 1 to 35 of 
10 SEQ ID NO: 1 numbered as shown below and corresponding to amino acids -35 to - 
1 of Fig. 1 . Therefore, in general the present invention makes use of a precursor 
peptide having the sequence: 

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 
Met-Asp-Ala-Met-Lys-Arg-Gly-Leu-Cys-Cys-Val-Leu-Leu-Leu-Cys-Gly- 
15 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 
Ala-Val-Phe-Val-Ser-Pro-Ser-Gln-Glu-Ile-His-Ala-Arg-Phe-Arg-Arg- 
33 34 35 

Gly-Ala-Arg (SEQ ID NO: 1). 

In another embodiment of the invention, the precursor peptide comprises a 

20 human t-PA pro-sequence as, for example, amino acids 22 to 32 (SEQ ID NO: 4) of 
SEQ ID NO: 1 , or amino acids 22-35 (SEQ ID NO: 5) of SEQ ID NO: 1 and more 
preferably amino acid residues 23-35 of SEQ ID NO: 1 (SEQ ID NO: 7). In another 
embodiment of the invention, the precursor peptide is a human signal-pro sequence 
comprising amino acids 1 to 32 (SEQ ID NO: 6) of SEQ ID NO: 1 and preferably 

25 amino acid residues 1-35 (SEQ ID NO: 1). Such precursor peptides and the DNA 
encoding them can be isolated from nature or can be produced by recombinant or 
synthetic means. 

- In a further aspect of the present invention the precursor peptides of the 
present invention are homologous amino acid sequences of mammalian t-PA's 
30 signal-pro peptides or homologous amino acid sequences of the sequence of, for 
example, SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID 
NO: 6, and SEQ ID NO: 7 including homologous in vitro generated variants having 
the qualitative biological activity defined above. Homology with respect to the 
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precursor peptides of the present invention is defined as the percentage of amino 
acid residues in a candidate sequence that are identical with either the amino acid 
residues in SEQ ID NO: 1, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID 
NO: 6, SEQ ID NO: 7, the amino acid sequence of a mammalian t-PA or a 
5 composite sequence of a human and other mammalian sequence as defined herein 
after aligning the sequences and introducing gaps if necessary to achieve the 
maximum identity. No N- or C- terminal extensions or deletions in the candidate 
sequence shall be construed as reducing identity. For example, N- or C -terminal 
addition of a signal sequence other than that of a mammalian t-PA to mammalian 

10 precursor peptide as defined herein is within the scope of the precursor peptide of 
the invention. "Composite amino acid" within the present invention refers to an 
alternate amino acid having the same position in the 35 amino acid residue structure 
as human t-PA from another mammalian vertebrate species. Therefore, an amino 
acid substitution referred to as a composite amino acid substitution replaces the 

1 5 identified amino acid with the equivalent or composite amino acid from another 
mammalian species. A composite sequence is defined as having at least one amino 
acid from the native human t-PA sequence replaced with a composite amino acid 
from another mammalian species. 

Therefore, the invention contemplates an precursor peptide having at least ; 

20 the qualitative biological activity as defined above and having, for example, at least 
about 75% amino acid homology with the polypeptide of SEQ ID NO: 1, SEQ ID 
NO: 3, SEQ ID NO: 4 or the polypeptide of SEQ ID NO: 1 lacking the 3 carboxyl 
terminal amino acid residues, SEQ ID NO: 6 and/or the 21 amino terminal residues, 
SEQ ID NO: 5. The precursor pep tide t amino acid sequence preferably will share at 

25 least 80%, more preferably, greater than 85% sequence homology with the sequence 
of SEQ ID NO: 1 or the sequences lacking the 3 carboxyl terminal amino acid 
residues (SEQ ID NO: 6) and/or the 21 amino terminal residues (SEQ ID NO: 5). 
However, a precursor peptide may exhibit less than 50% sequence homology with 
the sequence of SEQ ID NO: 1 and still retain the characteristics of the precursor 

30 peptide as defined above. 

Since the mechanism by which the signal-pro peptide of t-PA overcomes 
endoplasmic reticulum retention may relate to its complex structure, precursor 
peptides which comprise a portion of a signal-pro peptide as defined above are also 
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within the scope of the present invention. For example, as noted above, the signal- 
pro peptide of t-PA is unusual, and contains an approximately 21-22 amino acid pre- 
or signal sequence and a 10 or 1 1 amino acid pro-sequence, including a 3 amino acid 
exopeptidase cleavage site (Figure 1). Previous studies, (Berg and Grinnell (1991) 
5 Biochem. Biophys. Res. Commun. 179: 1289-1296) have shown that the pre- or 
signal sequence is cleaved co-translationally in the endoplasmic reticulum and that 
the pro-sequence is removed in the Golgi apparatus (GA) by cleavage at a fiirin 
processing site. The fact that the endo H sensitive, intracellular forms of an 
exemplary heterologous glycoprotein glycosylation site variant expressed with the 

1 0 amino terminal precursor peptide sequence of the present invention possessed higher 
molecular weight than the secreted products demonstrates that, like wild type t-PA, 
removal of the pro-sequence from glycosylation mutants occurs during export 
through the GA. Based on these observations, the amino terminal pro-sequence of t- 
PA represents a structural element able to promote ER to GA transport of the 

1 5 heterologous polypeptide, and especially glycosylation mutants. 

Therefore according to a preferred aspect of the present invention, a 
precursor peptide comprising all or a portion of the pro-sequence of a mammalian- 
tPA is employed to effect the export and secretion of the heterologous polypeptides 
of the present invention. For example, the pre-sequence of any mammalian pre- 

20 protein sequence, for example, that associated with the native heterologous 
polypeptide, may be employed in conjunction with the pro-sequence of a 
mammalian t-PA in the precursor peptide of the present invention. For example, 
insertion of the 13 amino acid tPA pro-peptide Ser-Gln-Glu-Ile-His-Ala-Arg-Phe- 
Arg-Arg-Gly-Ala-Arg (SEQ ID NO: 7) between the TNFRl-IgGl endogenous 

25 signal sequence, i.e., that naturally associated with TNFR1 (Gray et al., (1990) 
supra , amino acids -40 to -12 of Figure 1, SEQ ID NO: 2) and the TNFRi-IgGl 
coding sequence provides a precursor peptide (SEQ ID NO: 8) within the context of 
the present invention that exhibits an increase in export and secretion of the mature 
molecule (TNFRl-IgGl). 

30 According to another aspect of the present invention the DNA sequence of 

the t-PA pro-sequence (SEQ ID NO: 5; SEQ ID NO: 7) is modified to introduce or 
eliminate proteolytic processing sites. Such modified precursor peptides are 
described in, for example, International publication No. WO 96/17067. Briefly, and 
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while not wishing to be bound by theory it is believed that pro-sequence of 
mammalian t-PA is cleaved at a site dependent upon a prohormone converting 
enzyme such as the yeast KEX2 gene product or the mammalian enzymes PCI , PC2 
and furin. Enzymes of this type recognize cleavage sites characterized by arginine 
5 residues in the -1 and -4 positions. Cleavage is facilitated by a basic amino acid 
residue e.g. Lys or Arg in the -1 position. Within the present invention therefore are 
a pro-sequence having an Arg at residue 35 and 32 and optionally a basic amino acid 
residue at position 34. 

As indicated, the present invention further comprises a second DNA segment 

10 encoding a heterologous polypeptide, i.e., a polypeptide other than a mammalian t- 
PA such as a TNFR immunoadhesin. Preferably the heterologous polypeptide lacks 
or has been modified to delete one or more native glycosylation sites as described. 

Deletion of glycosylation sites from a heterologous polypeptide may be 
accomplished by altering the amino acid sequence of the polypeptide. The alteration 

1 5 may be made by, for example, the addition of or substitution by one or more amino 
acids to the polypeptide's native sequence. For example, if N-linked glycosylation 
is contemplated, the glycosylation site in the variant is a tripeptidyl sequence of the 
formula: asparagine-X-serine or asparagine-X-threonine, wherein asparagine is the 
acceptor and X is any of the twenty genetically encoded amino acids except proline. 

20 (See Struck, D.K. and Lennarz, W.J. in The Biochemistry of Glycoproteins and 
Proteoglycans , WJ. Lennarz ed., Plenum Press, 1980, p. 35; Marshall, R.D. (1974) 
Biochem. Soc. Symp. 40:17; and Winzler, RJ. in Hormonal Proteins and Peptides , 
Li, C.I. ed., Academic Press, New York, 1973, pp. 1-15). The amino acid sequence 
variants herein are modified by substituting the appropriate amino acid(s) at the 

25 proper site(s) to effect a change in the native glycosylation site. For example an 
asparagine residue can be replaced by a glutamine residue to disrupt a native 
glycosylation site within the polypeptide. 

If O-linked glycosylation is to be employed, O-glycosidic linkage occurs in 
animal cells between N-acetylgalactosamine, galactose or xylose, and one of several 

30 hydroxyamino acids, most commonly serine or threonine, but also in some cases a 5- 
hydroxyproline or 5-hydroxylysine residue placed in the appropriate region of the 
molecule. 
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According to the present invention the polypeptides are altered through 
changes at the DNA level, particularly by mutating the DNA encoding the 
polypeptide at preselected bases, i.e., those encoding N- or O-linked glycosylation 
sites such that codons are generated that will translate into amino acids other than 
5 those that will typically be present for N- or O-linked glycosylation. The variations 
can be made using methods known in the art such as oligonucleotide-mediated (site 
directed) mutagenesis, alanine scanning and PCR mutagenesis. Site directed 
mutagenesis (Carter et aL, (1986) Nucl. Acids Res. 13:433 1; Soller et aL., (1987) 
Nucl. Acd. Res., 10:6487, cassette mutagenesis (Wells et aL, (1985) Gene 34:315), 

10 restriction selection mutagenesis (Wells et al, (1986) Philos. Trans. R. Soc. 
London) or other known techniques can be performed on the cloned DNA to 
produce the glycosylation site mutants that are a subject of the present invention. 

Therefore, according to the present invention in order to direct the secretion 
of a heterologous polypeptide and preferably a glycosylation variant as described 

1 5 above, a DNA segment encoding a precursor peptide is joined to a DNA sequence 
encoding the heterologous polypeptide in the correct reading frame so that the joined 
sequences encode a fusion protein. 

Expression vectors for use in the methods and compositions of the present 
invention will generally include a promoter capable of directing the transcription of 

20 the first and second DNA of the invention. The term "control sequences" refers to 
DNA sequences necessary for the expression of an operably linked coding sequence 
in a particular host organism. The control sequences that are suitable for 
prokaryotes, for example, include a promoter, optionally an operator sequence, and a 
ribosome binding site. Eukaryotic cells are known to utilize promoters, 

25 polyadenylation signals, and enhancers. Preferred promoters include viral promoters 
and cellular promoters. 

Host cells are transfected or transformed with expression or cloning vectors 
described herein for heterologous polypeptide production and cultured in 
conventional nutrient media modified as appropriate for inducing promoters, 

30 selecting transformants, or amplifying the genes encoding the desired sequences. 
The culture conditions, such as media, temperature, pH and the like, can be selected 
by the skilled artisan without undue experimentation. In general, principles, 
protocols, and practical techniques for maximizing the productivity of cell cultures 
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can be found in Mammalian Cell Biotechnology: a Practical Approach . M. Butler, 
ed. (IRL Press, 1991) and Sambrook et al.., supra . 

Methods of transfection are known to the ordinarily skilled artisan, for 
example, CaP04 and electroporation, as well as the use of commercially available 
5 cationic lipid reagents which facilitate transfection. Depending on the host cell 
used, transformation is performed using standard techniques appropriate to such 
cells. The calcium treatment employing calcium chloride, as described in Sambrook 
et al, supra , or electroporation is generally used for prokaryotes or other cells that 
contain substantial cell-wall barriers. Infection with Agrobacterium tumefaciens is 

10 used for transformation of certain plant cells, as described by Shaw et al., Gene . 
. 23:315 (1983) and WO 89/05859 published 29 June 1989. For mammalian cells 
without such cell walls, the calcium phosphate precipitation method of Graham and 
van der Eb, Virology, 52:456-457 (1978) can be employed. General aspects of 
mammalian cell host system transformations have been described in U.S. Patent No. 

1 5 4,399,2 16. Transformations into yeast are typically carried out according to the 
method of Van Solingen et al., J. Bact. 130:946 (1977) and Hsiao etaL, Proc. 
Natl. Acad. Sci. (TJSAl 76:3829 (1979). However, other methods for introducing 
DNA into cells, such as by nuclear microinjection, electroporation, bacterial 
protoplast fusion with intact cells, or polycations, e.g., polybrene, polyornithine, 

20 may also be used. For various techniques for transforming mammalian cells, see 
Keown et al.., Methods in Enzymology , 185 :527-537 (1990) and Mansour et al.., 
Nature , 336:348-352 (1988). 

Suitable host cells for the expression of heterologous polypeptide that retain 
one or more glycosylation sites are derived from multicellular organisms. Examples 

25 of invertebrate cells include insect cells such as Drosophila S2 and Spodoptera Sf9, 
as well as plant cells. Examples of useful mammalian host cell lines include 
Chinese hamster ovary (CHO) and COS cells. More specific examples include 
monkey kidney CV1 line transformed by SV40 (COS-7, ATCC CRL 1651); human 
embryonic kidney line (293 or 293 cells subcloned for growth in suspension culture, 

30 Graham et al.., J. Gen Virol. . 36:59 (1977)); Chinese hamster ovary cells/-DHFR 
(CHO, Urlaub and Chasin, Proc. Natl. Acad. Sci. USA. 77:4216 (1980)); mouse 
Sertoli cells (TM4, Mather, Biol. Reprod.. 23:243-251 (1980)); human lung cells 
(W138, ATCC CCL 75); human liver cells (Hep G2, HB 8065); and mouse 
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mammary tumor (MMT 060562, ATCC CCL51). The selection of the appropriate 
host cell is deemed to be within the skill in the art. 

Host cells containing DNA constructs of the present invention are then 
cultured to produce the heterologous protein. The cells are cultured according to 
5 standard methods in a culture medium containing the necessary nutrients for growth 
of cultured mammalian cells. 

For the culture of the eukaryotic cells expressing the desired protein and 
modified as described for the instant invention, numerous culture conditions can be 
used paying particular attention to the host cell being cultured. Suitable culture 

10 conditions for eukaryotic cells are well known in the art (J. Immunol. Methods 
(1983)56:221-234) or can be easily determined by the skilled artisan (see, for 
example, Animal Cell Culture: A Practical Approach 2nd Ed., Rickwood, D. and 
Hames, B.D., eds. Oxford University Press, New York (1992)), and vary according 
to the particular host cell selected. 

1 5 The eukaryotic cell culture of the present invention is prepared in a medium 

suitable for the particular cell being cultured. Commercially available media such as 
Ham's F10 (Sigma), Minimal Essential Medium ([MEM], Sigma), RPMI-1640 
(Sigma), and Dulbecco's Modified Eagle's Medium ([DMEM], Sigma) are 
exemplary nutrient solutions. In addition, any of the media described in Ham and 

20 Wallace,(1979) Meth. Enz., 58:44; Barnes and Sato,(1980) Anal. Biochem., 

102:255; U.S. Patent Nos. 4,767,704; 4,657,866; 4,927,762; 5,122,469 or 4,560,655; 
International Publication Nos. WO 90/03430; and WO 87/00195; the disclosures of 
all of which are incorporated herein by reference, may be used as culture media. 
Any of these media may be supplemented as necessary with hormones and/or other 

25 growth factors (such as insulin, transferrin, or epidermal growth factor), salts (such 
as sodium chloride, calcium, magnesium, and phosphate), buffers (such as HEPES), 
nucleosides (such as adenosine and thymidine), antibiotics (such as Gentamycin™ 
drug), trace elements (defined as inorganic compounds usually present at final 
concentrations in the micromolar range) lipids (such as linoleic or other fatty acids) 

30 and their suitable carriers, and glucose or an equivalent energy source. Any other 
necessary supplements may also be included at appropriate concentrations that 
would be known to those skilled in the art. 
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In a particular embodiment, the mammalian host cell is a CHO cell, 
preferably a DHFR-CHO cell and a suitable medium contains a basal medium 
component such as a DMEM/HAM F-12 based formulation (for composition of 
DMEM and HAM F12 media, see culture media formulations in American Type 
5 Culture Collection Catalogue of Cell Lines and Hybridomas,, Sixth Edition, 1988, 
pages 346-349) (the formulation of medium as described in U.S. Patent 5,122,469 
are particularly appropriate) with modified concentrations of some components such 
as amino acids, salts, sugar, and vitamins, and optionally containing glycine, 
hypoxanthine, and thymidine; recombinant human insulin, hydrolyzed peptone, such 

10 as Primatone HS or Primatone RL (Sheffield, England), or the equivalent; a cell 
protective agent, such as Pluronic F68 or the equivalent pluronic polyol; 
Gentamycin; and trace elements. 

For the production of the sought after glycoproteins, production by growing 
the host cells of the present invention under a variety of cell culture conditions is 

15 typical. For instance, cell culture procedures for the large or small scale production 
of proteins are potentially useful within the context of the present invention. 
Procedures including, but not limited to, a fluidized bed bioreactor, hollow fiber 
bioreactor, roller bottle culture, or stirred tank bioreactor system may be used, in the 
later two systems, with or without microcarriers, and operated alternatively in a 

20 batch, fed-batch, or continuous mode. 

Following the polypeptide production phase, the polypeptide of interest is 
recovered from the culture medium using techniques which are well established in 
the art. 

The polypeptide of interest preferably is recovered from the culture medium 
25 as a secreted polypeptide. As a first step, the culture medium or lysate is centrifuged 
to remove particulate cell debris. The polypeptide thereafter is purified from 
contaminant soluble proteins and polypeptides, with the following procedures being 
exemplary of suitable purification procedures: by fractionation on immunoaffinity 
or ion-exchange columns; ethanol precipitation; reverse phase HPLC; 
30 chromatography on silica or on a cation-exchange resin such as DEAE; 

chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel filtration using, 
for example, Sephadex G-75; and protein A SEPHAROSE™ columns to remove 
contaminants such as IgG. A protease inhibitor such as phenyl methyl sulfonyl 



-24- 



fluoride (PMSF) also may be useful to inhibit proteolytic degradation during 
purification. 

The following examples are offered by way of illustration and not by way of 
limitation. The disclosures of all citations in the specification are expressly 
5 incorporated herein by reference. 
EXAMPLES 
Introduction 

The following are general methods used in the Examples that follow. 
10 Methods 

DNA and Constructions. The chimeric gene encoding TNFR-IgGl is 
described (Ashkenazi et aL, (1991) Proc. Natl. Acad. Sci. 88:10535-10539).Site 
directed mutagenesis was carried out following the mismatched primer method of 
Kunkel, et aL (1985) Proc. Natl. Acad. Sci. 82:488-492) using the Muta-Gene M13 

1 5 Kit (Bio-Rad Inc., Hercules, CA). Mutations were verified by dye terminator cycle 
sequencing using the automated ABI 310 Genetic Analyzer (Applied Biosy stems, 
Foster City, CA). For simplicity, the glycosylation mutants described in this paper 
are named using the single letter amino acid code with reference to the amino acid 
residue present at each of the four N-linked glycosylation sites at amino acid 

20 positions 14, 105, 1 1 1, and 248 (Figure 1). Thus, the designation NNNN represents 
the fully glycosylated TNFR-IgGl, whereas NQNN indicates TNFR-IgGl where 
glutamine replaced asparagine at the second glycosylation site (position 105). 
TNFR-IgGl glycosylation variants were cloned into the expression vector pRK5 
(Eaton et aL, (1986) Biochem. 291:8343-8347), that enabled transient expression in 

25 the human embiyonic kidney 293 cell line. The pRK5 chimeric transcription unit 
includes a cytomegalovirus (CMV) immediate early promoter and a simian virus 40 
(SV-40) polyadenylation site. For expression in CHO cells, gene encoding TNFR- 
IgGl variants were cloned into the closely related expression vector pSVI6B5 that 
differs from pRK5 in that transcription is directed by SV-40 promoter and enhancer 

30 elements rather than by the CMV promoter. 

Cells and Transfections. 293 cells were cultured in 100 mm culture plates 
containing a mixture of Dulbecco's modified Eagles medium (DMEM) and Ham's 
F12 medium (F12) (Gibco BRL, Grand Island, NY) supplemented with 10% whole 
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fetal bovine serum (FBS) and incubated at 37°C in an atmosphere containing 5% 
CO2. Confluent plates of 293 cells were passaged at a ratio of 1 :5; and were 
transfected at 60% confluency with plasmids containing TNFR-IgGl glycosylation 
variants by the calcium phosphate method (Graham and Van Der Eb (1973) Virol. 
5 52:456-467). For expression of TNFR-IgGl variants in CHO cell lines, the genes 
encoding TNFR-IgGl glycosylation site mutants were cloned into the pSVI6B5 
expression vector. These plasmids were then co-transfected into CHO cells 
deficient in the production of dihydrofolate reductase (dhfr) along with a plasmid 
(pFDl 1) containing a cDNA encoding murine dhfr. The transfected cells were 

10 selected for the ability to grow in culture medium deficient in the production of 
glycine, hypoxanthine and thymidine. Resulting colonies were picked and then 
selected for growth in varying concentration of methotrexate. Stable cell lines 
expressing TNFR-IgGl glycosylation site mutants were grown in 10 liter mini- 
fermentors, and growth conditioned cell culture medium was harvested for affinity 

15 purification (described below). 

Metabolic Labeling and Immunoprecipitation. Two days post-transfection, 
the culture medium was removed from the transfected 293 cells and the cell 
monolayer washed twice with phosphate buffered saline (PBS). Cells were 
incubated in methionine and cysteine free DMEM and supplemented with ProMix 

20 35S-Cell Labelling Mix (Amersham, Arlington Hts, IL) (100(Ci/mL). The cells 
were labeled 8-16 hours at 37 °C in an atmosphere of 5% C02. After labeling, the 
cell supernatants were removed, centrifuged at 4000 rpm for 5 minutes, and 1 mL 
was aliquoted for immunoprecipitation experiments. The labeled cells were washed 
three times with PBS, lysed directly on the culture dishes with cell lysis buffer (PBS 

25 containing 3% NP-40), and centrifuged at 14,000 x g for 5 minutes. The lysate was 
transferred to a fresh tube, and 200 (iL was removed for immunoprecipitation. 
Immunoprecipitation of both supernatants and lysates was accomplished by addition 
of 30 jxL of S. aureus Protein A (Pharmacia Inc., Piscataway, NJ). After a short 
incubation period on ice (10-15 minutes) the Protein A/TNFR-IgGl complexes were 

30 sedimented by centrifugation at 14,000 x g for 1 minute, washed in wash buffer' 
(PBS, 1% NP-40, 0.1% sodium dodecyl sulfate (SDS)), and resuspended in 2X 
sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) sample 
buffer containing 5% 2-mercaptoethanol (2-ME). The immunoprecipitated proteins 
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were resolved by SDS-PAGE using 10% Tris-Glycine polyacrylamide gels (Novex, 
Inc., San Diego, CA). Proteins were visualized by autoradiography and mobilities 
were calculated with reference to 14 C-Methylated Rainbow Colored Protein 
Molecular Weight Markers (Amersham, Arlington Heights, IL). Proteins were 
5 quantitated by densitometric scanning of autoradiographs using the Personal 
Densitometer (Molecular Dynamics, Sunnyvale, CA). 

Pulse Chase Analysis. 48 hours post-transfection, cells were pulse-labeled 
with ProMix 35 S-Cell Labeling Mix (500(Ci /mL) for 15 minutes. The labeling 
medium was then removed and the cell monolayer was washed at room temperature 

10 with PBS. Fresh medium (DMEM/F12 + 10% FBS) was then added, and cells were 
harvested at various time-points. For harvesting, plates were chilled on ice; 
supernatants were removed, and the cell monolayer was washed once with PBS. 
Cells were then lysed with lmL of lysis buffer. Lysate and supernatant samples 
were aliquoted and immunoprecipitated as described above. 

1 5 Purification of TNFR-IgG glycosylation site mutants. Growth conditioned 

culture medium from cultures at 10 L CHO cells expressing TNFR-IgG 1 
glycosylation site mutations were purified by affinity chromatography on 
immobilized S. aureus protein A using a method described previously (Chamow et 
aL (1994) J. Immunol. 153:4268-4280). Four TNFR-IgGl variants were purified: 

20 NNNN, NNQQ, NSNQ, and QSNQ. Briefly, purification was involved the 

following steps. (I) Prior to sample loading, the Protein A column was equilibrated 
with 20 mM Tris buffer, 1 5 mM NaCl (pH 7.4). After the growth conditioned cell 
culture medium was loaded onto the column, the column was sequentially washed 
with the following buffers: 20 mM Tris buffer, 150 mM NaCl (pH 7.4); 20 mM Tris 

25 buffer, 500 mM TMAC (pH 7.4); and 20 mM Tris buffer (pH 7.4). The 

glycosylation site mutants were eluted with 50 mM citric acid, 20% (w/v) glycerol 
(pH 3.0) and the elution pools were subsequently adjusted to pH 6.0 using 1.0 M 
sodium citrate. Finally, the eluates were buffer exchanged by gel filtration 
chromatography into PBS (pH 7.4). The purified mutants were analyzed using SDS- 

30 PAGE (12.5%) with Coomassie blue staining. 

TNF binding assay. The binding of TNF to TNFR-IgGl and TNFR-IgGl 
glycosylation site mutants was determined using a competitive binding 
immunoassay similar to that described by Ashkenazi, et aL. (1991) supra . Briefly, 
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microtiter plates were coated with affinity purified goat antibodies to human IgG Fc 
domain. Purified TNFR-IgGl or TNFR-IgGl glycosylation site mutants were 
captured on the plates and then reacted with varying concentrations of unlabeled 
TNF (0.8 to 800 nM) and a fixed amount of ( I25 I)-labeled TNF (0.05 nM). After 
5 washing the amount of TNF remaining bound to the wells of the microtiter plates 
was determined with a gamma counter. Data were fit using a four parameter curve. 
The effective concentration of unlabeled TNF that resulted in half maximal (50%) 
binding of ( 125 I)-labeled TNF to TNFR-IgGl glycosylation mutants was reported as 
EC50 values. 

10 EXAMPLE 1 

TNFR-IgGl is a chimeric protein constructed by fusing the extracellular 
domain of the receptor for TNF alpha with sequences encoding the Fc domain and 
hinge region of IgG 1 (Ashkenazi et a/.., (1991) supra ). The chimeric protein (Figure 
1) contains four N-linked glycosylation sites: three located in the TNFR region 

1 5 (positions 14, 105, and 111) and one in the Fc domain (position 248). Previous 
studies have shown that TNFR-IgGl is secreted as a homodimer, binds to TNF 
alpha with high affinity, and has potent anti-inflammatory activity in vivo. 

Pulse-chase experiments were carried out to characterize the secretion 
efficiency of TNFR-IgGl. In these studies, 293 cells were transiently transfected 

20 with a calcium phosphate precipitated plasmid (pRK. TNFR-IgGl) and cultured for 2 
days. The cells were pulse-labeled with ( 35 S)- methionine, and samples were 
collected at various time points. Immunoprecipitation studies (Figure 2) indicated 
that TNFR-IgGl first appeared in the cell culture medium approximately 1 hr after 
pulse labeling, ahd that the amount of secreted protein progressively increased over 

25 the entire 24 hr observation period. These studies also showed that a significant 
fraction of the TNFR-IgGl synthesized during a 15 minute pulse labeling was 
retained inside the cells. Densitometric analysis of these data (Figure 3A) 
demonstrated that only about 50% of the pulse-labeled protein was secreted from the 
cell during a 24 hr period, and that labeled precursor appeared to remain in a stable 

30 intracellular pool for more than 24 hr. The kinetics and pattern of secretion observed 
for TNFR-IgGl were similar to those previously reported for two other secreted 
variants of membrane glycoproteins (Berman et al.., (1989) J. Virol. 63: 3489-3498) 
suggesting that secretion of TNFR-IgGl was similarly inefficient. 
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Endoglycosidase digestion and immunofluorescence studies similar to those 
described by Machamer and Rose (1988) J. Biol. Chem. 263:5955-5960) showed 
that the intracellular TNFR-IgGl could be localized to the ER and was sensitive to 
endoglycosidase H digestion (endoH). These studies further showed that secreted 
5 TNFR-IgGl was resistant to endo H digestion and sensitive to neuraminidase 
digestion. Together these results suggested that the intracellular TNFR-IgGl 
accumulated in the endoplasmic reticulum (ER) and possessed the characteristic 
high mannose form of N-linked carbohydrate, whereas the secreted protein 
possessed terminal sialation acquired in the trans-Golgi apparatus (tGA). 
10 EXAMPLE 2 

Deletion of glycosylation sites can impair secretion of TNFR-IgGl . To 
examine the effect of sequential deletion of glycosylation sites on the secretion of 
TNFR-IgGl, further experiments were carried out with a series of glycosylation 
mutants (Table 1). 
15 Table 1 

Summary of TNFRl-IgGl Glycosylation Mutants 
Amino acid position Secretion Efficiency 

14 105 111 248 



20 



25 



30 



N 


N 


N 


N 


+++ 


Q 


N 


N 


N 


++ 


N 


Q 


Q. 


N 


+ 


N 


N 


N 


Q 


+++ 


N 


N 


Q 


N 


+ 


N 


Q 


N 


N 




Q 


N 


N 


Q 


+ 


N 


N 


Q 


Q 


+ 


Q 


Q 


Q 


N 


+/- 


Q 


Q 


Q 


Q 




N 


s 


N 


N 


+++ 


N 


. N 


S 


N 


+ 


N 


D 


D 


N 


+ 


N 


K 


K 


N 


++ 



-29- 



N S S N 

N R R N + 
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In these studies, mutagenesis primers were designed to replace the codon specifying 
asparagine (N) in the N-linked carbohydrate attachment seqon, N-X-S/T, with 

10 codons specifying glutamine (Q). Because glycosylation sites 2 and 3 in TNFR- 
IgGl are only a few amino acids apart (residues 105 and 1 1 1), a single mutageneis 
primer could be used to mutate both sites simultaneously. We found that all of the 
glycosylation mutants were expressed, but there were significant differences in 
secretion efficiency (Figure 4A). Removal of the fourth glycosylation site (residue 

15 248), located in the Fc domain of TNFR-IgGl (NNNQ) had no effect on secretion, 
relative to wild type TNFR-IgGl (NNNN). In contrast, removal of glycosylation 
sites 2 and 3 (NQQN) resulted in a nearly complete blockade of secretion. Deletion 
of the first glycosylation site (QNNN) also blocked secretion, however in this case 
secretion was approximately 35% of that seen with the fully glycosylated protein. 

20 Removal of the first three glycosylation sites (QQQN) completely inhibited 
secretion as did removal of all four glycosylation sites (QQQQ). 

Further mutants were constructed (Table 1) to study, independently, the 
influence of glycosylation sites 2 and 3 on secretion. It was found that removal of 
glycosylation site 3 alone (NNQN) allowed for secretion at approximately 10% of 

25 level seen with the fully glycosylated protein, whereas deletion of glycosylation site 
2 alone (NQNN) completely abolished secretion (Figure 4B). Deletion of 
glycosylation sites 3 and 4 (NNQQ) resulted in a secretion efficiency (Figure 4C) 
that was similar to the NNQN mutant but lower than the NNNQ mutant (Figure 4B). 
Deletion of glycosylation sites 1 and 4 (QNNQ) resulted in significantly lower 

30 levels of secretion (Figure 4C) than was observed with the QNNN and NNNQ 
mutants alone (Figure 4A). 
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EXAMPLE 3 

Nature of the amino acid side chain used to mutate N linked glycosylation 
site can effect secretion efficiency. Additional studies were carried out to determine 
if the nature of the amino acid side chain used for replacement of the N residue in 
5 the sequence creating an N-linked glycosylation site affected secretion efficiency. 
Each N-linked oligosaccharide chain can possess two to four sialic acid residues, 
depending on the antennarity of the high mannose precursor. We reasoned that 
replacement of N at N-linked glycosylation sites with charged residues (e.g. aspartic 
acid, glutamic acid, lysine, arginine) might have less effect on conformation than 

10 replacement with hydrophilic uncharged side-chains (e.g. glutamine, threonine, 
serine). We found that replacement of the N residues at glycosylation sites 2 and 3 
with acidic, aspartic acid residues (NDDN) or basic, lysine residues (NKKN) 
allowed for better secretion (Figure 4B) than the uncharged glutamine residues in 
the (NQQN) construct (Figure 4A); however, the amount of protein secreted was far 

1 5 less than that observed with the wild-type TNFR-IgGl protein. Replacement of 

asparagine at sites 2 and 3 with arginine (NRRN), also gave better secretion than the 
NQQN mutant, but the total amount of secreted protein appeared to be less 
compared to the NDDN or NKKN mutants (Figure 4D). 

We next considered the effect of uncharged, hydrophilic side-chains, serine 

20 and threonine, on the secretion efficiency of TNFR-IgGl . We found that 

replacement of glycosylation sites 2 and 3 with serine (NSSN) or threonine (NTTN), 
like the glutamine mutants (NQQN) completely inhibited secretion (Figure 4D). 
However, single substitutions of serine for asparagine at glycosylation sites 2 and 3 
with serine gave different results (Figure 4D). The replacement of serine for 

r 

25 asparagine at glycosylation site 2 (NSNN) allowed for better secretion than any of 
the other amino acid replacements examined at this position. In contrast, 
replacement of serine for asparagine at glycosylation site 3 (NNSN) resulted in 
secretion similar to that observed in the NNQN mutant (Figure 4B). 

EXAMPLE 4 

30 Triple deletion mutants. To investigate the effect of glycosylation site 4 in 

combination with sites 1-3, a series of mutants were constructed where three of the 
four glycosylation sites were deleted (Table 1). It was found that the NDDQ, the 
NKKQ, the NDDN and NKKN mutants were all secreted from transfected cells 
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(Figure 4C). However, in all four cases the amount of protein secreted was a small 
fraction (approximately 10%) of the amount secreted by fully glycosylated TNFR- 
IgGl . Thus, removal of glycosylation site 4 did not alter the secretion efficiency of 
variants where asparagine residues at glycosylation sites 2 and 3 were replaced by 
5 lysine or aspartic acid. A final mutant where three of the four glycosylation sites 
were deleted (QSNQ) was also examined (Table 1, Figure 5). This variant 
accumulated intracellular^ and was not secreted. 

EXAMPLE 5 

Secretion efficiency of immunoadhesins can be improved by altering the 
10 signal sequence. Studies were performed to demonstrate that the yield and secretion 
efficiency of a representative immunoadhesin, TNFR-IgGl, from stably transfected 
CHO cells can be improved by replacement of the TNFR-IgGl signal sequence with 
that of human tissue plasminogen activator (tPA). In these studies the signal 
sequences of herpes simplex virus type 1 glycoprotein D (HSV gD-1), Met-Gly-Gly- 
15 Thr-Ala-Ala-Arg-Leu-Gly-Ala-Val^ 

Val-Arg-Gly (SEQ ID NO: 9), recombinant human DNase, Met-Arg-Gly-Lys-Leu- 
Leu-Gly-Ala-Leu-Leu-Ala-Leu-Ala-Ala-Leu-Leu-Gln-Gly-Ala-Val-Ser (SEQ ID 
NO: 10), HER-2, Met-Gly-Trp-Ser-Cys-Ile^ 

Gly-Val-His-Ser (SEQ ID NO: 1 1), TNFR, Met-Gly-Leu-Ser-Tto>Val-Pro-Asp-Leu- 
20 Leu-Leu-Pro-Leu- Val-Leu-Leu-Glu-Leu-Leu-Val-Gly-Ile-Tyr-Pro-Ser-Giy-Val-Ile- 
Gly (SEQ ID NO: 2) (Ashkenazi et al., (1991) Proc. Natl. Acad. Sci. 88: 10535- 
10539), and t-PA (SEQ ID NO: 1) were operably linked to the nucleic acid sequence 
encoding TNFRl-IgGl and the secretion kinetics were followed. The results 
indicated an increase in amount of TNFR-IgGl that could be recovered from the 
25 supernatant as well as an improvement in the secretion kinetics (2 hrs and 24 hrs). 
To determine whether the tPA signal/pro-sequence improved yield by 
increasing the efficiency of intracellular transport, a plasmid (pRK.tPA-TNFR- 
IgGl) was constructed for transient transfection studies in 293 cells. Pulse-chase 
studies (Figure 2, Figure 3 A and B) showed that replacement of the TNFR signal 
30 sequence with the tPA signal/pro sequence resulted in a significant improvement in 
secretion efficiency. Thus, after 24 hours, approximately 70-80% of the pulse- 
. labeled TNFR-IgGl was secreted into the cell culture medium when the tPA 
signal/pro sequence was included, whereas only 50-60% of the protein was secreted 
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using the wild-type TNFR signal sequence. The increased efficiency of secretion 
achieved with the tPA signal/pro sequence was apparent from the kinetics of 
secretion of TNFR-IgGl (Figures 2, Figures 3 A and B). When the TNF signal/pro 
sequence was used, only 40% of the pulse-labeled TNFR-IgGl was secreted in 2-4 
5 hr after pulse labeling. However when the TNF signal sequence was replace with 
the tPA signal/pro sequence approximately 60% of the TNFR-IgG synthesized with 
the TNFR signal sequence was secreted in this time frame. 

Studies of the biosynthesis of tPA have shown that the N terminal processing 
of tPA is complex and involves co-translational cleavage of a 21 amino acid signal 

10 sequence, post-translation cleavage of an 1 1 amino acid pro-sequence at a furin 
cleavage site, and extracellular cleavage of a 3 amino acid, N-terminal peptide by 
an undefined exopeptidase (Figure 1). This complex strategy for N terminal 
processing of TNFR-IgGl mutants is supported by our results showing that 
intracellular TNFR-IgGl variants possessing the tPA signal/pro sequence possessed 

1 5 higher molecular weights than the corresponding secreted proteins (Figure 5 A). 

Since furin is known to be localized in the trans-GA, the pro-sequence would not be 
expected to be removed until the tPA-TNFR precursor has been transported from the 
ER to the trans-GA. Endoglycosidase digestion studies and immunofluorescence 
studies have provided data consistent with an ER localization of intracellular 

20 variants of tP A. TNFR-IgG 1 . 

EXAMPLE 6 

Secretion efficiency of glycosylation site mutants can be improved by 
altering the signal sequence! To determine whether signal sequence exchange could 
improve the secretion efficiency of the TNFR-IgGl glycosylation site mutants 
25 described in Figure 4, the TNFR signal sequence was deleted and replaced by the 
tPA signal/pro sequence. It was found (Table 2, Figure 5) that the signal sequence 
exchange resulted in a marked increase in secretion efficiency of these glycosylation 
site mutants. 
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Table 2 

Effect of tPA Signal Sequence or Secretion of TNFRl-IgGl Glvcosvlation Mutants 

% Secretion Efficiency* EC50 for 
TNF Binding ! 

5 

Mutant TNFR.ss tPA.ss (nM) 

NNNN 50 70 6.66+/- 0.73 

10 N N Q Q 20 70 4.74 +/- 0.48 

N S N Q <5 60 6.94 +/- 0.83 



15 



NKKQ 10 40 ND 

Q S N Q <5 65 2.34 +/- 0.22 



QQQQ <5 <5 ND 

* Data represent results from pulse-chase experiments (e.g. Figure 3) where the 
20 percentage of pulse-labeled protein secreted in a 24 hr period was measured by 
scanning densitometry. 

JData represent EC50 values for the displacement of ( 125 I)-labeled TNF by TNFR- 
IgGl glycosylation site mutants (Figure 7). 

For example, only 10-20 % of the NNQQ mutant was secreted using the 

25 TNFR signal sequence, whereas 60-70% of the NNQQ mutant containing the tPA 
signal/pro sequence was secreted (Figures 3C and D, Figure 5). Similarly, little or 
none of the QSNQ mutant was secreted containing the TNFR signal sequence, but 
approximately 60% was secreted from the tPA signal/pro sequence containing 
variant (Figures 3E and F, Figure 5). Similar improvements in secretion efficiency 

30 were observed for the NKKQ and NSNQ mutants (Figure 5). Densitometric analysis 
of pulse chase experiments (Figure 3) showed that attachment of the tPA signal/pro 
sequence accelerated the kinetics of intracellular transport as well as increasing the 
total amount of secreted protein. Although replacement of the tPA /pro sequence 
could overcome the blockade of protein secretion for many of the glycosylation 
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mutants, this strategy was not effective for the fully glycosylated QQQQ 
glycosylation site mutant (Figure 5). 

EXAMPLE 7 

Structure and TNF binding activity of TNFR-IgGl glycosylation mutants. 
5 Although the signal sequence exchange strategy described above provided a method 
that allowed for the secretion of TNFR-IgGl glycosylation site mutants, we 
wondered whether the secreted proteins were properly folded. To answer this 
question, stable CHO cell lines, expressing several of the TNFR-IgGl glycosylation 
mutants, were constructed in order to produce sufficient quantities for ligand binding 

10 studies. Recombinant proteins representing the NNQQ, NSNQ, and QSNQ mutants 
were purified from growth conditioned cell culture medium using protein A affinity 
chromatography. It was found (Figure 6) that the glycosylation site mutants, like 
TNFR-IgGl, associated to form covalent disulfide bonded dimers that were stable 
in SDS and could be dissociated by the addition of reducing agents (e.g. 2- 

1 5 mercaptoethanol). As expected, the molecular mass of the glycosylation variants 
correlated with the number of intact glycosylation sites. Thus the variants with two 
glycosylation sites deleted (NNQQ and NSNQ) were smaller in molecular weight 
than the wild type (NNNN) proteins and larger in molecular mass than the QSNQ 
variant the possessed only one glycosylation site. 

20 TNF binding studies were carried out to determine whether the secreted 

glycosylation variants were folded into a functionally relevant conformation. For 
this purpose, the three glycosylation mutants illustrated in Figure 6 were evaluated 
in a binding assay (Figure 7) using ( 125 I)-labeled TNF by a method similar to that 
described by Ashkenazi et al. All three mutants bound the ( 125 I)-labeled TNF with 

25 EC 50 values (Table 2) comparable to that observed for fully glycosylated TNFR- 
IgGl . These studies suggested that neither glycosylation sites nor the tPA signal/pro 
sequence affected the interaction of TNFR-IgGl with it's ligand. 
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EXAMPLE 8 

To determine the influence of the tPA propeptide on secretion kinetics of 
tPA/TNFR-IgGl a construct was generated containing the 29aa TNFR signal 
sequence (SEQ ID NO: 2) and a 13 amino acid t-PA propeptide (SEQ ID NO: 7) 
5 having the sequence Met-Gly-Leu-Ser-Thr-Val-Pro-Asp-Leu-Leu^ 
l^u-I^u-Glu-Leu-Leu-Val-Gly-Ile-Tyr-Pro-Ser-Gly-Val-Ile-Gly^ 
His-Ala-Arg-Phe-Arg-Arg-Gly-Ala-Arg(SEQ ID NO: 8) was prepared. From RIP 
experiments SEQ ID NO: 8 was shown to have similar t 1/2max and percent secretion 
at 24 hours to that of tPA/TNFR-IgGl (Figure 8 A-C). Endo H digest also reveal 

10 similar secretion kinetics to that of tPA/TNFR-IgGl, where approximately 1/3 of the 
lysate is endo H resistant between 10 and 30 minutes and is probably secreted within 
30 minutes, as opposed to the wild type TNFr-IgG in which very little of the lysate 
is Endo H resistant and the band does not decrease over time. This evidence 
strongly indicates that the decrease in t 1/2max and percent secretion at 24 hours for 

1 5 tPA/TNFR-IgGl is due to the presence of the 13 amino acid propeptide. This is 

supported by transient expression data, wherein 14 yields similar total expression as 
well as specific productivity. 
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